Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 2130471 |
| Missing cells | 4357845 |
| Missing cells (%) | 12.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 276.3 MiB |
| Average record size in memory | 136.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 5 |
| BOOL | 1 |
data has a high cardinality: 410 distinct values | High cardinality |
municipio has a high cardinality: 5297 distinct values | High cardinality |
nomeRegiaoSaude has a high cardinality: 440 distinct values | High cardinality |
codmun is highly correlated with coduf and 1 other fields | High correlation |
coduf is highly correlated with codmun and 1 other fields | High correlation |
codRegiaoSaude is highly correlated with coduf and 1 other fields | High correlation |
obitosAcumulado is highly correlated with casosAcumulado and 1 other fields | High correlation |
casosAcumulado is highly correlated with obitosAcumulado and 1 other fields | High correlation |
obitosNovos is highly correlated with casosNovos | High correlation |
casosNovos is highly correlated with obitosNovos | High correlation |
Recuperadosnovos is highly correlated with casosAcumulado and 1 other fields | High correlation |
estado is highly correlated with regiao | High correlation |
regiao is highly correlated with estado | High correlation |
Recuperadosnovos has 2130114 (> 99.9%) missing values | Missing |
emAcompanhamentoNovos has 2130114 (> 99.9%) missing values | Missing |
populacaoTCU2019 is highly skewed (γ1 = 63.68529126) | Skewed |
casosAcumulado is highly skewed (γ1 = 98.21588694) | Skewed |
casosNovos is highly skewed (γ1 = 92.80026898) | Skewed |
obitosAcumulado is highly skewed (γ1 = 88.56965586) | Skewed |
obitosNovos is highly skewed (γ1 = 117.250021) | Skewed |
casosAcumulado has 275101 (12.9%) zeros | Zeros |
casosNovos has 1265420 (59.4%) zeros | Zeros |
obitosAcumulado has 691212 (32.4%) zeros | Zeros |
obitosNovos has 1977578 (92.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-10 18:44:26.679326 |
|---|---|
| Analysis finished | 2021-04-10 18:52:00.589797 |
| Duration | 7 minutes and 33.91 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 410 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| 2021-01-13 | 5620 |
|---|---|
| 2020-12-24 | 5620 |
| 2020-12-19 | 5619 |
| 2020-07-11 | 5619 |
| 2021-01-01 | 5619 |
| Other values (405) |
| Value | Count | Frequency (%) | |
| 2021-01-13 | 5620 | 0.3% | |
| 2020-12-24 | 5620 | 0.3% | |
| 2020-12-19 | 5619 | 0.3% | |
| 2020-07-11 | 5619 | 0.3% | |
| 2021-01-01 | 5619 | 0.3% | |
| 2020-10-23 | 5619 | 0.3% | |
| 2021-02-05 | 5619 | 0.3% | |
| 2020-05-02 | 5619 | 0.3% | |
| 2020-06-18 | 5619 | 0.3% | |
| 2020-05-23 | 5619 | 0.3% | |
| Other values (400) | 2074279 | 97.4% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| Nordeste | |
|---|---|
| Sudeste | |
| Sul | |
| Centro-Oeste | |
| Norte |
| Value | Count | Frequency (%) | |
| Nordeste | 687027 | 32.2% | |
| Sudeste | 635328 | 29.8% | |
| Sul | 453756 | 21.3% | |
| Centro-Oeste | 179391 | 8.4% | |
| Norte | 174557 | 8.2% | |
| Brasil | 412 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 6.727493122 |
| Min length | 3 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 412 |
| Missing (%) | < 0.1% |
| Memory size | 16.3 MiB |
| MG | |
|---|---|
| SP | |
| RS | |
| BA | |
| PR | |
| Other values (22) |
| Value | Count | Frequency (%) | |
| MG | 324076 | 15.2% | |
| SP | 245244 | 11.5% | |
| RS | 189152 | 8.9% | |
| BA | 158832 | 7.5% | |
| PR | 152010 | 7.1% | |
| SC | 112594 | 5.3% | |
| GO | 94023 | 4.4% | |
| PI | 85685 | 4.0% | |
| PB | 85306 | 4.0% | |
| MA | 83032 | 3.9% | |
| Other values (17) | 600105 | 28.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.000193384 |
| Min length | 2 |
| Distinct | 5297 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 19441 |
| Missing (%) | 0.9% |
| Memory size | 16.3 MiB |
| Bom Jesus | 1895 |
|---|---|
| São Domingos | 1895 |
| São Francisco | 1516 |
| Santa Terezinha | 1516 |
| Bonito | 1516 |
| Other values (5292) |
| Value | Count | Frequency (%) | |
| Bom Jesus | 1895 | 0.1% | |
| São Domingos | 1895 | 0.1% | |
| São Francisco | 1516 | 0.1% | |
| Santa Terezinha | 1516 | 0.1% | |
| Bonito | 1516 | 0.1% | |
| Santa Inês | 1516 | 0.1% | |
| Vera Cruz | 1516 | 0.1% | |
| Santa Helena | 1516 | 0.1% | |
| Planalto | 1516 | 0.1% | |
| Santa Luzia | 1516 | 0.1% | |
| Other values (5287) | 2095112 | 98.3% | |
| (Missing) | 19441 | 0.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 32 |
|---|---|
| Median length | 10 |
| Mean length | 11.52917266 |
| Min length | 3 |
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.35978382 |
|---|---|
| Minimum | 11 |
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 25 |
| median | 31 |
| Q3 | 41 |
| 95-th percentile | 51 |
| Maximum | 76 |
| Range | 65 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.876236041 |
|---|---|
| Coefficient of variation (CV) | 0.3052009276 |
| Kurtosis | -0.4680371154 |
| Mean | 32.35978382 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.1587902939 |
| Sum | 68941581 |
| Variance | 97.54003833 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=28)
| Value | Count | Frequency (%) | |
| 31 | 324076 | 15.2% | |
| 35 | 245244 | 11.5% | |
| 43 | 189152 | 8.9% | |
| 29 | 158832 | 7.5% | |
| 41 | 152010 | 7.1% | |
| 42 | 112594 | 5.3% | |
| 52 | 94023 | 4.4% | |
| 22 | 85685 | 4.0% | |
| 25 | 85306 | 4.0% | |
| 21 | 83032 | 3.9% | |
| Other values (18) | 600517 | 28.2% |
| Value | Count | Frequency (%) | |
| 11 | 20497 | 1.0% | |
| 12 | 8748 | 0.4% | |
| 13 | 23908 | 1.1% | |
| 14 | 6474 | 0.3% | |
| 15 | 54986 | 2.6% |
| Value | Count | Frequency (%) | |
| 76 | 412 | < 0.1% | |
| 53 | 789 | < 0.1% | |
| 52 | 94023 | 4.4% | |
| 51 | 54228 | 2.5% | |
| 50 | 30351 | 1.4% |
| Distinct | 5591 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 11482 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 325258.0141 |
|---|---|
| Minimum | 110000 |
| Maximum | 530010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 110000 |
|---|---|
| 5-th percentile | 150770 |
| Q1 | 251200 |
| median | 314610 |
| Q3 | 411915 |
| 95-th percentile | 510730 |
| Maximum | 530010 |
| Range | 420010 |
| Interquartile range (IQR) | 160715 |
Descriptive statistics
| Standard deviation | 98535.04745 |
|---|---|
| Coefficient of variation (CV) | 0.3029442571 |
| Kurtosis | -0.5267074737 |
| Mean | 325258.0141 |
| Median Absolute Deviation (MAD) | 74150 |
| Skewness | 0.122032906 |
| Sum | 6.892181541e+11 |
| Variance | 9709155576 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 412350 | 379 | < 0.1% | |
| 350640 | 379 | < 0.1% | |
| 292273 | 379 | < 0.1% | |
| 313140 | 379 | < 0.1% | |
| 210923 | 379 | < 0.1% | |
| 410304 | 379 | < 0.1% | |
| 270560 | 379 | < 0.1% | |
| 251272 | 379 | < 0.1% | |
| 314350 | 379 | < 0.1% | |
| 353590 | 379 | < 0.1% | |
| Other values (5581) | 2115199 | 99.3% | |
| (Missing) | 11482 | 0.5% |
| Value | Count | Frequency (%) | |
| 110000 | 379 | < 0.1% | |
| 110001 | 379 | < 0.1% | |
| 110002 | 379 | < 0.1% | |
| 110003 | 379 | < 0.1% | |
| 110004 | 379 | < 0.1% |
| Value | Count | Frequency (%) | |
| 530010 | 379 | < 0.1% | |
| 522230 | 379 | < 0.1% | |
| 522220 | 379 | < 0.1% | |
| 522205 | 379 | < 0.1% | |
| 522200 | 379 | < 0.1% |
| Distinct | 450 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 19441 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32403.1237 |
|---|---|
| Minimum | 11001 |
| Maximum | 53001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 11001 |
|---|---|
| 5-th percentile | 15012 |
| Q1 | 25010 |
| median | 31059 |
| Q3 | 41015 |
| 95-th percentile | 51013 |
| Maximum | 53001 |
| Range | 42000 |
| Interquartile range (IQR) | 16005 |
Descriptive statistics
| Standard deviation | 9836.343636 |
|---|---|
| Coefficient of variation (CV) | 0.3035615865 |
| Kurtosis | -0.5240045909 |
| Mean | 32403.1237 |
| Median Absolute Deviation (MAD) | 7056 |
| Skewness | 0.1399733425 |
| Sum | 6.840396622e+10 |
| Variance | 96753656.13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 22009 | 15918 | 0.7% | |
| 24006 | 14023 | 0.7% | |
| 50001 | 12886 | 0.6% | |
| 43016 | 12507 | 0.6% | |
| 50003 | 12507 | 0.6% | |
| 26003 | 12128 | 0.6% | |
| 31007 | 12128 | 0.6% | |
| 22004 | 11749 | 0.6% | |
| 41015 | 11370 | 0.5% | |
| 42001 | 11370 | 0.5% | |
| Other values (440) | 1984444 | 93.1% | |
| (Missing) | 19441 | 0.9% |
| Value | Count | Frequency (%) | |
| 11001 | 3411 | 0.2% | |
| 11002 | 2274 | 0.1% | |
| 11003 | 5306 | 0.2% | |
| 11004 | 1895 | 0.1% | |
| 11005 | 3032 | 0.1% |
| Value | Count | Frequency (%) | |
| 53001 | 379 | < 0.1% | |
| 52018 | 3032 | 0.1% | |
| 52017 | 4548 | 0.2% | |
| 52016 | 3790 | 0.2% | |
| 52015 | 6822 | 0.3% |
| Distinct | 440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 19441 |
| Missing (%) | 0.9% |
| Memory size | 16.3 MiB |
| CENTRAL | 21982 |
|---|---|
| SUL | 16676 |
| VALE DO RIO GUARIBAS | 15918 |
| 6ª REGIAO DE SAUDE - PAU DOS FERROS | 14023 |
| NORTE | 13265 |
| Other values (435) |
| Value | Count | Frequency (%) | |
| CENTRAL | 21982 | 1.0% | |
| SUL | 16676 | 0.8% | |
| VALE DO RIO GUARIBAS | 15918 | 0.7% | |
| 6ª REGIAO DE SAUDE - PAU DOS FERROS | 14023 | 0.7% | |
| NORTE | 13265 | 0.6% | |
| CAMPO GRANDE | 12886 | 0.6% | |
| REGIAO 16 | 12507 | 0.6% | |
| DOURADOS | 12507 | 0.6% | |
| CARUARU | 12128 | 0.6% | |
| POUSO ALEGRE | 12128 | 0.6% | |
| Other values (430) | 1967010 | 92.3% | |
| (Missing) | 19441 | 0.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 42 |
|---|---|
| Median length | 11 |
| Mean length | 13.34476836 |
| Min length | 3 |
semanaEpi
Real number (ℝ≥0)
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.71391209 |
|---|---|
| Minimum | 1 |
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 14 |
| median | 26 |
| Q3 | 40 |
| 95-th percentile | 51 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 15.25448307 |
|---|---|
| Coefficient of variation (CV) | 0.5710314167 |
| Kurtosis | -1.206169529 |
| Mean | 26.71391209 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.04191808142 |
| Sum | 56913215 |
| Variance | 232.6992536 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 14 | 73047 | 3.4% | |
| 13 | 50711 | 2.4% | |
| 10 | 39529 | 1.9% | |
| 12 | 39529 | 1.9% | |
| 11 | 39529 | 1.9% | |
| 9 | 39473 | 1.9% | |
| 52 | 39334 | 1.8% | |
| 2 | 39334 | 1.8% | |
| 39 | 39333 | 1.8% | |
| 38 | 39333 | 1.8% | |
| Other values (43) | 1691319 | 79.4% |
| Value | Count | Frequency (%) | |
| 1 | 39333 | 1.8% | |
| 2 | 39334 | 1.8% | |
| 3 | 39333 | 1.8% | |
| 4 | 39333 | 1.8% | |
| 5 | 39333 | 1.8% |
| Value | Count | Frequency (%) | |
| 53 | 39333 | 1.8% | |
| 52 | 39334 | 1.8% | |
| 51 | 39333 | 1.8% | |
| 50 | 39333 | 1.8% | |
| 49 | 39333 | 1.8% |
| Distinct | 5104 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 7959 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118909.4324 |
|---|---|
| Minimum | 781 |
| Maximum | 210147125 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 781 |
|---|---|
| 5-th percentile | 2515 |
| Q1 | 5474 |
| median | 11708 |
| Q3 | 25768 |
| 95-th percentile | 122859 |
| Maximum | 210147125 |
| Range | 210146344 |
| Interquartile range (IQR) | 20294 |
Descriptive statistics
| Standard deviation | 3058444.129 |
|---|---|
| Coefficient of variation (CV) | 25.72078655 |
| Kurtosis | 4324.374913 |
| Mean | 118909.4324 |
| Median Absolute Deviation (MAD) | 7546 |
| Skewness | 63.68529126 |
| Sum | 2.523866971e+11 |
| Variance | 9.354080488e+12 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5237 | 1895 | 0.1% | |
| 3203 | 1516 | 0.1% | |
| 7642 | 1516 | 0.1% | |
| 3802 | 1516 | 0.1% | |
| 4573 | 1516 | 0.1% | |
| 11019 | 1516 | 0.1% | |
| 18895 | 1137 | 0.1% | |
| 25216 | 1137 | 0.1% | |
| 4786 | 1137 | 0.1% | |
| 5348 | 1137 | 0.1% | |
| Other values (5094) | 2108489 | 99.0% | |
| (Missing) | 7959 | 0.4% |
| Value | Count | Frequency (%) | |
| 781 | 379 | < 0.1% | |
| 837 | 379 | < 0.1% | |
| 935 | 379 | < 0.1% | |
| 1034 | 379 | < 0.1% | |
| 1112 | 379 | < 0.1% |
| Value | Count | Frequency (%) | |
| 210147125 | 412 | < 0.1% | |
| 45919049 | 410 | < 0.1% | |
| 21168791 | 410 | < 0.1% | |
| 17264943 | 410 | < 0.1% | |
| 14873064 | 410 | < 0.1% |
| Distinct | 35575 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2693.212752 |
|---|---|
| Minimum | 0 |
| Maximum | 13373174 |
| Zeros | 275101 |
| Zeros (%) | 12.9% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 14 |
| median | 118 |
| Q3 | 464 |
| 95-th percentile | 3103 |
| Maximum | 13373174 |
| Range | 13373174 |
| Interquartile range (IQR) | 450 |
Descriptive statistics
| Standard deviation | 88095.9064 |
|---|---|
| Coefficient of variation (CV) | 32.71034058 |
| Kurtosis | 11168.33667 |
| Mean | 2693.212752 |
| Median Absolute Deviation (MAD) | 118 |
| Skewness | 98.21588694 |
| Sum | 5737811666 |
| Variance | 7760888725 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 275101 | 12.9% | |
| 1 | 56867 | 2.7% | |
| 2 | 34366 | 1.6% | |
| 3 | 25336 | 1.2% | |
| 4 | 21198 | 1.0% | |
| 5 | 19458 | 0.9% | |
| 6 | 16625 | 0.8% | |
| 7 | 14480 | 0.7% | |
| 8 | 13455 | 0.6% | |
| 9 | 12255 | 0.6% | |
| Other values (35565) | 1641330 | 77.0% |
| Value | Count | Frequency (%) | |
| 0 | 275101 | 12.9% | |
| 1 | 56867 | 2.7% | |
| 2 | 34366 | 1.6% | |
| 3 | 25336 | 1.2% | |
| 4 | 21198 | 1.0% |
| Value | Count | Frequency (%) | |
| 13373174 | 1 | < 0.1% | |
| 13279857 | 1 | < 0.1% | |
| 13193205 | 1 | < 0.1% | |
| 13100580 | 1 | < 0.1% | |
| 13013601 | 1 | < 0.1% |
| Distinct | 3984 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.83129224 |
|---|---|
| Minimum | -13915 |
| Maximum | 100158 |
| Zeros | 1265420 |
| Zeros (%) | 59.4% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | -13915 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 26 |
| Maximum | 100158 |
| Range | 114073 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 588.0277341 |
|---|---|
| Coefficient of variation (CV) | 31.22609573 |
| Kurtosis | 10364.5316 |
| Mean | 18.83129224 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 92.80026898 |
| Sum | 40119522 |
| Variance | 345776.6161 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1265420 | 59.4% | |
| 1 | 215485 | 10.1% | |
| 2 | 112453 | 5.3% | |
| 3 | 74466 | 3.5% | |
| 4 | 54399 | 2.6% | |
| 5 | 42307 | 2.0% | |
| 6 | 33467 | 1.6% | |
| 7 | 27251 | 1.3% | |
| 8 | 23486 | 1.1% | |
| 9 | 19665 | 0.9% | |
| Other values (3974) | 262072 | 12.3% |
| Value | Count | Frequency (%) | |
| -13915 | 1 | < 0.1% | |
| -7926 | 1 | < 0.1% | |
| -3684 | 1 | < 0.1% | |
| -2977 | 1 | < 0.1% | |
| -2532 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 100158 | 1 | < 0.1% | |
| 93317 | 1 | < 0.1% | |
| 92625 | 1 | < 0.1% | |
| 91097 | 1 | < 0.1% | |
| 90638 | 1 | < 0.1% |
| Distinct | 7785 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.57465884 |
|---|---|
| Minimum | 0 |
| Maximum | 348718 |
| Zeros | 691212 |
| Zeros (%) | 32.4% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 9 |
| 95-th percentile | 71 |
| Maximum | 348718 |
| Range | 348718 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 2322.288223 |
|---|---|
| Coefficient of variation (CV) | 31.56369679 |
| Kurtosis | 9235.907096 |
| Mean | 73.57465884 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 88.56965586 |
| Sum | 156748677 |
| Variance | 5393022.59 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 691212 | 32.4% | |
| 1 | 250886 | 11.8% | |
| 2 | 168914 | 7.9% | |
| 3 | 123510 | 5.8% | |
| 4 | 98472 | 4.6% | |
| 5 | 76608 | 3.6% | |
| 6 | 65148 | 3.1% | |
| 7 | 53218 | 2.5% | |
| 8 | 42939 | 2.0% | |
| 9 | 37347 | 1.8% | |
| Other values (7775) | 522217 | 24.5% |
| Value | Count | Frequency (%) | |
| 0 | 691212 | 32.4% | |
| 1 | 250886 | 11.8% | |
| 2 | 168914 | 7.9% | |
| 3 | 123510 | 5.8% | |
| 4 | 98472 | 4.6% |
| Value | Count | Frequency (%) | |
| 348718 | 1 | < 0.1% | |
| 345025 | 1 | < 0.1% | |
| 340776 | 1 | < 0.1% | |
| 336947 | 1 | < 0.1% | |
| 332752 | 1 | < 0.1% |
| Distinct | 692 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4910435298 |
|---|---|
| Minimum | -292 |
| Maximum | 4249 |
| Zeros | 1977578 |
| Zeros (%) | 92.8% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | -292 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 4249 |
| Range | 4541 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 16.36046199 |
|---|---|
| Coefficient of variation (CV) | 33.31774273 |
| Kurtosis | 19853.74739 |
| Mean | 0.4910435298 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 117.250021 |
| Sum | 1046154 |
| Variance | 267.6647166 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1977578 | 92.8% | |
| 1 | 92690 | 4.4% | |
| 2 | 21044 | 1.0% | |
| 3 | 8929 | 0.4% | |
| 4 | 4878 | 0.2% | |
| -1 | 3720 | 0.2% | |
| 5 | 3338 | 0.2% | |
| 6 | 2269 | 0.1% | |
| 7 | 1669 | 0.1% | |
| 8 | 1310 | 0.1% | |
| Other values (682) | 13046 | 0.6% |
| Value | Count | Frequency (%) | |
| -292 | 1 | < 0.1% | |
| -238 | 1 | < 0.1% | |
| -221 | 1 | < 0.1% | |
| -111 | 1 | < 0.1% | |
| -75 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4249 | 1 | < 0.1% | |
| 4195 | 1 | < 0.1% | |
| 3869 | 1 | < 0.1% | |
| 3829 | 1 | < 0.1% | |
| 3780 | 1 | < 0.1% |
| Distinct | 357 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2130114 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4587084.748 |
|---|---|
| Minimum | 22130 |
| Maximum | 11791885 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 22130 |
|---|---|
| 5-th percentile | 54554 |
| Q1 | 1321036 |
| median | 4568813 |
| Q3 | 7167651 |
| 95-th percentile | 10526727.6 |
| Maximum | 11791885 |
| Range | 11769755 |
| Interquartile range (IQR) | 5846615 |
Descriptive statistics
| Standard deviation | 3416743.181 |
|---|---|
| Coefficient of variation (CV) | 0.7448615773 |
| Kurtosis | -1.014059367 |
| Mean | 4587084.748 |
| Median Absolute Deviation (MAD) | 2934539 |
| Skewness | 0.2887360224 |
| Sum | 1637589255 |
| Variance | 1.167413396e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 7709602 | 1 | < 0.1% | |
| 4470163 | 1 | < 0.1% | |
| 3671128 | 1 | < 0.1% | |
| 6354972 | 1 | < 0.1% | |
| 5189529 | 1 | < 0.1% | |
| 9323696 | 1 | < 0.1% | |
| 10601658 | 1 | < 0.1% | |
| 7144011 | 1 | < 0.1% | |
| 9281018 | 1 | < 0.1% | |
| 325395 | 1 | < 0.1% | |
| Other values (347) | 347 | < 0.1% | |
| (Missing) | 2130114 | > 99.9% |
| Value | Count | Frequency (%) | |
| 22130 | 1 | < 0.1% | |
| 22991 | 1 | < 0.1% | |
| 24325 | 1 | < 0.1% | |
| 25318 | 1 | < 0.1% | |
| 26573 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 11791885 | 1 | < 0.1% | |
| 11732193 | 1 | < 0.1% | |
| 11664158 | 1 | < 0.1% | |
| 11558784 | 1 | < 0.1% | |
| 11436189 | 1 | < 0.1% |
| Distinct | 357 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2130114 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 622927.0588 |
|---|---|
| Minimum | 14062 |
| Maximum | 1317658 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 14062 |
|---|---|
| 5-th percentile | 69550.4 |
| Q1 | 427526 |
| median | 642131 |
| Q3 | 794182 |
| 95-th percentile | 1209815.2 |
| Maximum | 1317658 |
| Range | 1303596 |
| Interquartile range (IQR) | 366656 |
Descriptive statistics
| Standard deviation | 297312.6697 |
|---|---|
| Coefficient of variation (CV) | 0.4772832798 |
| Kurtosis | -0.06565441078 |
| Mean | 622927.0588 |
| Median Absolute Deviation (MAD) | 179567 |
| Skewness | 0.06726410252 |
| Sum | 222384960 |
| Variance | 8.839482356e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 464923 | 1 | < 0.1% | |
| 705020 | 1 | < 0.1% | |
| 325957 | 1 | < 0.1% | |
| 825232 | 1 | < 0.1% | |
| 48872 | 1 | < 0.1% | |
| 804035 | 1 | < 0.1% | |
| 563782 | 1 | < 0.1% | |
| 540692 | 1 | < 0.1% | |
| 895919 | 1 | < 0.1% | |
| 1296002 | 1 | < 0.1% | |
| Other values (347) | 347 | < 0.1% | |
| (Missing) | 2130114 | > 99.9% |
| Value | Count | Frequency (%) | |
| 14062 | 1 | < 0.1% | |
| 15015 | 1 | < 0.1% | |
| 16013 | 1 | < 0.1% | |
| 17533 | 1 | < 0.1% | |
| 19606 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1317658 | 1 | < 0.1% | |
| 1309541 | 1 | < 0.1% | |
| 1305248 | 1 | < 0.1% | |
| 1300185 | 1 | < 0.1% | |
| 1296002 | 1 | < 0.1% |
interior/metropolitana
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 19441 |
| Missing (%) | 0.9% |
| Memory size | 16.3 MiB |
| 0 | |
|---|---|
| 1 | 146294 |
| (Missing) | 19441 |
| Value | Count | Frequency (%) | |
| 0 | 1964736 | 92.2% | |
| 1 | 146294 | 6.9% | |
| (Missing) | 19441 | 0.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| data | regiao | estado | municipio | coduf | codmun | codRegiaoSaude | nomeRegiaoSaude | semanaEpi | populacaoTCU2019 | casosAcumulado | casosNovos | obitosAcumulado | obitosNovos | Recuperadosnovos | emAcompanhamentoNovos | interior/metropolitana | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2020-02-25 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 9 | 210147125.0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 1 | 2020-02-26 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 9 | 210147125.0 | 1 | 1 | 0 | 0 | NaN | NaN | NaN |
| 2 | 2020-02-27 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 9 | 210147125.0 | 1 | 0 | 0 | 0 | NaN | NaN | NaN |
| 3 | 2020-02-28 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 9 | 210147125.0 | 1 | 0 | 0 | 0 | NaN | NaN | NaN |
| 4 | 2020-02-29 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 9 | 210147125.0 | 2 | 1 | 0 | 0 | NaN | NaN | NaN |
| 5 | 2020-03-01 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 10 | 210147125.0 | 2 | 0 | 0 | 0 | NaN | NaN | NaN |
| 6 | 2020-03-02 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 10 | 210147125.0 | 2 | 0 | 0 | 0 | NaN | NaN | NaN |
| 7 | 2020-03-03 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 10 | 210147125.0 | 2 | 0 | 0 | 0 | NaN | NaN | NaN |
| 8 | 2020-03-04 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 10 | 210147125.0 | 3 | 1 | 0 | 0 | NaN | NaN | NaN |
| 9 | 2020-03-05 | Brasil | NaN | NaN | 76 | NaN | NaN | NaN | 10 | 210147125.0 | 7 | 4 | 0 | 0 | NaN | NaN | NaN |
Last rows
| data | regiao | estado | municipio | coduf | codmun | codRegiaoSaude | nomeRegiaoSaude | semanaEpi | populacaoTCU2019 | casosAcumulado | casosNovos | obitosAcumulado | obitosNovos | Recuperadosnovos | emAcompanhamentoNovos | interior/metropolitana | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2130461 | 2021-03-31 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 13 | 3015268.0 | 344364 | 1253 | 6029 | 117 | NaN | NaN | 1.0 |
| 2130462 | 2021-04-01 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 13 | 3015268.0 | 345682 | 1318 | 6150 | 121 | NaN | NaN | 1.0 |
| 2130463 | 2021-04-02 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 13 | 3015268.0 | 346873 | 1191 | 6207 | 57 | NaN | NaN | 1.0 |
| 2130464 | 2021-04-03 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 13 | 3015268.0 | 348687 | 1814 | 6235 | 28 | NaN | NaN | 1.0 |
| 2130465 | 2021-04-04 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 14 | 3015268.0 | 349775 | 1088 | 6288 | 53 | NaN | NaN | 1.0 |
| 2130466 | 2021-04-05 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 14 | 3015268.0 | 351163 | 1388 | 6366 | 78 | NaN | NaN | 1.0 |
| 2130467 | 2021-04-06 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 14 | 3015268.0 | 352067 | 904 | 6449 | 83 | NaN | NaN | 1.0 |
| 2130468 | 2021-04-07 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 14 | 3015268.0 | 353206 | 1139 | 6532 | 83 | NaN | NaN | 1.0 |
| 2130469 | 2021-04-08 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 14 | 3015268.0 | 354816 | 1610 | 6609 | 77 | NaN | NaN | 1.0 |
| 2130470 | 2021-04-09 | Centro-Oeste | DF | Brasília | 53 | 530010.0 | 53001.0 | DISTRITO FEDERAL | 14 | 3015268.0 | 356558 | 1742 | 6676 | 67 | NaN | NaN | 1.0 |